perf: replace OFFSET pagination with ID-range batching in async jobs by kneckinator · Pull Request #148 · OpenSPP/OpenSPP2

kneckinator · 2026-04-02T08:01:23Z

Summary

Replace OFFSET-based pagination in all async job dispatchers with NTILE-based ID-range batching
OFFSET N causes PostgreSQL to scan and discard N rows, making later batches O(N) slower — with 1M records and batch size 2000, the last batch scans ~1M rows
ID-range batching uses WHERE id BETWEEN min_id AND max_id, which is O(1) via the primary key index regardless of batch position
Add min_id/max_id support to get_beneficiaries() on both spp.program and spp.cycle

Changes

pagination_utils.py (new): compute_id_ranges() helper using PostgreSQL NTILE window function to pre-compute (min_id, max_id) boundaries in a single SQL query
program_manager.py: _enroll_eligible_registrants_async() dispatches jobs with ID ranges instead of offset/limit
cycle_manager_base.py: _check_eligibility_async() and _prepare_entitlements_async() use ID ranges
cycle_manager.py: Updated CustomDefaultCycleManager._prepare_entitlements() override to forward min_id/max_id
programs.py / cycle.py: get_beneficiaries() supports min_id/max_id for range queries
test_keyset_pagination.py (new): 15 tests covering the helper, get_beneficiaries range queries, and async dispatch integration

Context

Phase 6 of 9 in the spp_programs performance optimization effort. Rebased on current 19.0. Version bumped to 19.0.2.0.8.

Test plan

./scripts/test_single_module.sh spp_programs — 606 tests, 0 failures
pre-commit run --files <changed_files> — all checks pass
For async operations: test via UI with ODOO_INIT_MODULES=spp_programs docker compose --profile ui up -d

gemini-code-assist

Code Review

This pull request introduces ID-range based keyset pagination to replace inefficient OFFSET-based pagination for asynchronous job dispatching. It adds a new utility, compute_id_ranges, which utilizes PostgreSQL's NTILE function to calculate batch boundaries, and updates the get_beneficiaries methods and various managers in the spp_programs module to support min_id and max_id parameters. Feedback was provided regarding a potential race condition in the pagination utility that could lead to a TypeError if records are deleted between database queries.

codecov · 2026-04-02T08:05:50Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 71.35%. Comparing base (21089c5) to head (b805be1).
⚠️ Report is 5 commits behind head on 19.0.

Additional details and impacted files

@@            Coverage Diff             @@
##             19.0     #148      +/-   ##
==========================================
+ Coverage   71.24%   71.35%   +0.11%     
==========================================
  Files         931      932       +1     
  Lines       54742    54771      +29     
==========================================
+ Hits        38999    39082      +83     
+ Misses      15743    15689      -54

Flag	Coverage Δ
spp_api_v2	`80.10% <ø> (ø)`
spp_api_v2_change_request	`66.85% <ø> (ø)`
spp_api_v2_cycles	`71.12% <ø> (ø)`
spp_api_v2_data	`64.41% <ø> (ø)`
spp_api_v2_entitlements	`70.19% <ø> (ø)`
spp_api_v2_gis	`71.52% <ø> (ø)`
spp_api_v2_products	`66.27% <ø> (ø)`
spp_api_v2_service_points	`70.94% <ø> (ø)`
spp_api_v2_simulation	`71.12% <ø> (ø)`
spp_api_v2_vocabulary	`57.26% <ø> (ø)`
spp_audit	`72.60% <ø> (+0.06%)`	⬆️
spp_base_common	`90.26% <ø> (ø)`
spp_case_entitlements	`97.61% <ø> (ø)`
spp_case_programs	`97.14% <ø> (ø)`
spp_cel_event	`85.11% <ø> (ø)`
spp_claim_169	`58.11% <ø> (ø)`
spp_dci_client_dr	`55.87% <ø> (ø)`
spp_dci_client_ibr	`60.17% <ø> (ø)`
spp_programs	`63.50% <100.00%> (+0.93%)`	⬆️
spp_security	`66.66% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
spp_programs/__manifest__.py	`0.00% <ø> (ø)`
spp_programs/models/cycle.py	`65.03% <100.00%> (+0.36%)`	⬆️
spp_programs/models/managers/cycle_manager.py	`78.26% <100.00%> (ø)`
spp_programs/models/managers/cycle_manager_base.py	`66.36% <100.00%> (+11.24%)`	⬆️
spp_programs/models/managers/pagination_utils.py	`100.00% <100.00%> (ø)`
spp_programs/models/managers/program_manager.py	`92.74% <100.00%> (+14.40%)`	⬆️
spp_programs/models/programs.py	`87.95% <100.00%> (+0.10%)`	⬆️

... and 1 file with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

gonzalesedwin1123 · 2026-04-06T06:56:55Z

@kneckinator is this really not intended to be added to the init.py?

OFFSET N causes PostgreSQL to scan and discard N rows, making later batches progressively slower. This replaces all async job dispatchers with NTILE-based ID-range batching that uses WHERE id BETWEEN min_id AND max_id, which is O(1) via the primary key index.

Handle the unlikely case where records are deleted between the COUNT and MIN/MAX queries, which would cause a TypeError on None values.

Cover the _check_eligibility_async, _prepare_entitlements_async, and the isinstance(states, str) branch in _enroll_eligible_registrants_async.

gemini-code-assist Bot reviewed Apr 2, 2026

View reviewed changes

Comment thread spp_programs/models/managers/pagination_utils.py Outdated

kneckinator force-pushed the worktree-perf+phase6-keyset-pagination branch 3 times, most recently from ffb1394 to b4b9ddc Compare April 2, 2026 08:17

kneckinator requested a review from gonzalesedwin1123 April 3, 2026 02:24

gonzalesedwin1123 reviewed Apr 6, 2026

View reviewed changes

gonzalesedwin1123 approved these changes Apr 6, 2026

View reviewed changes

kneckinator and others added 4 commits April 17, 2026 10:28

fix: guard against race condition in compute_id_ranges

c16fb31

Handle the unlikely case where records are deleted between the COUNT and MIN/MAX queries, which would cause a TypeError on None values.

test: add coverage for async dispatch methods

81b4a5a

Cover the _check_eligibility_async, _prepare_entitlements_async, and the isinstance(states, str) branch in _enroll_eligible_registrants_async.

docs: bump version to 19.0.2.0.8, add changelog for ID-range pagination

b805be1

gonzalesedwin1123 force-pushed the worktree-perf+phase6-keyset-pagination branch from b4b9ddc to b805be1 Compare April 17, 2026 02:31

gonzalesedwin1123 merged commit 26b9060 into 19.0 Apr 17, 2026
34 checks passed

gonzalesedwin1123 deleted the worktree-perf+phase6-keyset-pagination branch April 17, 2026 02:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: replace OFFSET pagination with ID-range batching in async jobs#148

perf: replace OFFSET pagination with ID-range batching in async jobs#148
gonzalesedwin1123 merged 4 commits into19.0from
worktree-perf+phase6-keyset-pagination

kneckinator commented Apr 2, 2026 •

edited by gonzalesedwin1123

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

codecov Bot commented Apr 2, 2026 •

edited

Loading

Uh oh!

gonzalesedwin1123 Apr 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kneckinator commented Apr 2, 2026 • edited by gonzalesedwin1123 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Context

Test plan

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

codecov Bot commented Apr 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

gonzalesedwin1123 Apr 6, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kneckinator commented Apr 2, 2026 •

edited by gonzalesedwin1123

Loading

codecov Bot commented Apr 2, 2026 •

edited

Loading